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Abstract 

Until recently, LP relaxations have only played a very limited role in the design of approximation 
algorithms for the Steiner tree problem. In particular, no (efficiently solvable) Steiner tree relaxation was 
known to have an integrality gap bounded away from 2, before Byrka et al. [3] showed an upper bound 
of sa 1.55 of a hypergraphic LP relaxation and presented a ln(4) + e w 1.39 approximation based on 
this relaxation. Interestingly, even though their approach is LP based, they do not compare the solution 
produced against the LP value. 

We take a fresh look at hypergraphic LP relaxations for the Steiner tree problem — one that heavily 
exploits methods and results from the theory of matroids and submodular functions — which leads to 
stronger integrality gaps, faster algorithms, and a variety of structural insights of independent interest. 
More precisely, along the lines of the algorithm of Byrka et al. [3], we present a deterministic ln(4) + e 
approximation that compares against the LP value and therefore proves a matching ln(4) upper bound on 
the integrality gap of hypergraphic relaxations. 

Similarly to [3], we iteratively fix one component and update the LP solution. However, whereas 
in [3] the LP is solved at every iteration after contracting a component, we show how feasibility can be 
maintained by a greedy procedure on a well-chosen matroid. Apart from avoiding the expensive step 
of solving a hypergraphic LP at each iteration, our algorithm can be analyzed using a simple potential 
function. This potential function gives an easy means to determine stronger approximation guarantees 
and integrality gaps when considering restricted graph topologies. In particular, this readily leads to a 
g| « 1.217 upper bound on the integrality gap of hypergraphic relaxations for quasi-bipartite graphs. 

Additionally, for the case of quasi-bipartite graphs, we present a simple algorithm to transform an 
optimal solution to the bidirected cut relaxation to an optimal solution of the hypergraphic relaxation, 
leading to a fast g| approximation for quasi-bipartite graphs. Furthermore, we show how the separation 
problem of the hypergraphic relaxation can be solved by computing maximum flows, which provides a 
way to obtain a fast independence oracle for the matroids that we use in our approach. 



*E-mail: goemans @math . mit . edu. Supported by NSF grants CCF-1 1 15849 and CCF-0829878, and by ONR grant N00014- 
11-1-0053. 

tE-mail: olver@math.mit.edu. Supported by NSF grant CCF-1 1 15849. 

*E-mail: rothvossOmath .mit . edu. Supported by the Alexander von Humboldt Foundation within the Feodor Lynen 
program, by ONR grant N00014-1 1-1-0053 and by NSF contract CCF-0829878. 

§E-mail: ricoz@math . mit . edu. Supported by NSF grants CCF-1 1 15849 and CCF-0829878, and by ONR grants N00014- 
11-1-0053 andN00014-09-l-0326. 



1 Introduction 



The Steiner tree problem is one of the most fundamental and important problems in Computer Science and 
Operations Research. Whereas a 2-approximation is easily obtained by computing a minimum spanning tree 
over the terminals, obtaining algorithms with an approximation guarantee bounded away from 2 has proven 
to be a non-trivial task. The problem is known to be inapproximable to within ||, unless NP = P [1, 8]). 
There has been a long sequence of combinatorial approximation algorithms [9, 24, 12, 19, 21], based on 
different greedy approaches, culminating in the famous 1 + + e < 1.55 approximation of Robins and 
Zelikovsky [21]. No further progress was achieved until Byrka, Grandoni, RothvoB and Sanita [3] presented 
the first LP-based approach leading to a ln(4) + e f« 1.39 approximation. A major hindrance in the design of 
LP-based Steiner tree algorithms is a rather poor understanding of potential LP relaxations. In particular, 
until the result of [3] , for no (efficiently solvable) LP relaxation of the Steiner tree problem was it known 
whether the integrality gap was bounded away from 2. Intriguingly, even though their ln(4) + e approximation 
algorithm is based on a particular LP relaxation, its approximation guarantee is not with respect to the LP 
solution and does not imply a ln(4) integrality gap for the relaxation. In [3], the authors show a weaker 
?a 1.55 integrality gap using a technique not directly linked to their algorithm. Chakrabarty et al. [7] provide 
a simpler alternative proof of the same bound. 

The linear relaxation used by Byrka et al., the directed component-based relaxation, was introduced 
by Polzin and Vahdati-Daneshmand [18], based in turn on an equivalent undirected component-based LP 
introduced by Warme [23]. It is the undirected version that we will use in this paper. Another notable 
relaxation is the partition-based LP introduced by Konemann et al. [13]. In [6], Chakrabarty et al. showed 
that this relaxation is equivalent to the others mentioned above, and introduced the term "hypergraphic" for 
this family of relaxations. They also proved that basic solutions are sparse, having support size less than the 
number of terminals. 

The limited understanding of LP relaxations of the Steiner tree problem is arguably a major barrier in 
the design of stronger approximation algorithms. The goal of this work is to fill this gap by providing a 
fresh view on the component-based LP relaxation — one that heavily exploits methods and results from the 
theory of matroids and submodular functions. More precisely, based on the approach of Byrka et al. [3], we 
present a deterministic ln(4) + e algorithm that starts with a solution to the component-based LP relaxation, 
iteratively contracts a component and updates the LP solution. The algorithm of Byrka et al. solves the 
component-based LP (through a very large extended formulation) in each iteration after contracting, in order 
to again obtain a feasible solution. By contrast, we show how the LP can be modified by a simple greedy 
algorithm over a well-chosen matroid to achieve the same goal. This leads to a considerably faster way to 
update the LP, but more importantly, we show how the approximation quality of our approach can be analyzed 
with respect to the initial LP solution. This implies a bound of the integrality gap of the component-based LP 
relaxation of ln(4). By comparison, the best known lower bound is 8/7 w 1.142 (e.g., by the example of 
[13]). Furthermore, we show how the separation problem of the component-based relaxation can be reduced 
to computing maximum flows. Whereas this result is likely to be of independent interest, it also provides a 
way to obtain a fast independence oracle for the matroids that we use in our approach. 

Additionally, we further investigate the special case of quasi-bipartite graphs, which has played a central 
role in the design of approximation algorithms for the Steiner tree problem, as well as to find APX-hard 
problem classes. Rajagopalan and Vazirani [20] showed that the integrality gap of the bidirected cut relaxation 
for such graphs can be bounded by 3/2. This was later improved to 4/3 [5] and to 1.28 [7]. We obtain a 
bound for the integrality gap, again matching the approximation factor of [3]. Such a bound was previously 
known only for the case when all edge costs are equal [7]. Chakrabarty et al. [6] showed that on quasi-bipartite 
graphs, the bidirected cut and hypergraphic relaxations are actually equivalent. However their proof is based 
on a duality argument, and they leave as an open problem the question of converting a solution from the 
bidirected cut relaxation to the hypergraphic relaxation efficiently (more quickly than simply optimizing the 
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hypergraphic LP). We present a simple algorithm to perform this transformation; since the bidirected cut 
relaxation can be solved much more efficiently via a compact extended formulation, this gives a much faster 
method of solving the hypergraphic LP in the quasi-bipartite case. Combining this result with the suggested 
approximation algorithm, we obtain a significantly faster approximation than the one of Byrka et al. [3], 
since we do not need to (repeatedly) optimize the component-based relaxation by using either the ellipsoid 
method or a very large extended formulation. 

2 Discussion of results and techniques 
2.1 The component-based LP 

Let G = {V,E)h& an undirected graph with terminals R(1V and edge costs c : E ^ M+. A component C 
is simply a subgraph of G with the property that it is a tree spanning y(C), all leaves of C are terminals, and 
all internal nodes are non-terminals. Write cost(C) := X]ee£(C) '^(^) '^^^^ ^ component C. We 

will frequently need the terminal set of a component C G /C, and so by abuse of notation, when we refer to 
C as a vertex set, we mean the set V{C) r\ Roi terminals in C. In particular, \C\ refers to the number of 
terminals in G. 

Now let /C be the set of all components of G; we assume that all components contain at least two terminals, 
else they can be safely removed. We use the notation {Z)'^ := max{Z, 0}. Then the component-based LP 
relaxation is as follows [23] : 

min Xc cost(C) 

CgK. 

Y,^c{\snG\-iy < \s\-i 

C&K. 

Y,xc{\C\-l) = \R\-l 
xc > 

Borchers and Du [2] showed that the optimal k-restricted Steiner tree, meaning only components with at 
most k terminals can be used, has cost at most 1 + l/[log2 k\ times the cost of an optimal Steiner tree. 
Furthermore, when restricting the variables in (LP) to components with at most k terminals, the resulting 
linear program can be solved efficiently, e.g., by solving a polynomial-size extended formulation [3]. It 
follows that for any fixed e > 0, a (1 + e)-approximate solution to (LP) can be obtained efficiently. We also 
point out in Appendix E that optimizing (LP) exactly is strongly NP-hard (this does not seem to have been 
previously observed). 

The framework of our algorithm is similar to Byrka et al. [4], and in particular, it is iterative in nature. 
They begin by computing a near-optimal fractional solution x to (LP). They then sample a component C at 
random, proportional to its entry xc, and contract this component. The solution x is no longer feasible to 
(LP) on this new contracted instance, so they re-solve the LP and iterate this procedure until all terminals are 
connected. 

In their analysis, they show that a single random contraction reduces the cost of the optimum Steiner 
tree by a certain factor in each iteration. The crucial ingredient is a lower bound on the expected cost of 
edges that could be removed from an optimum solution after a contraction, while still obtaining a Steiner 
tree. To obtain a bound on the integrality gap, we need a stronger result that says even & fractional solution 
becomes significantly cheaper after a random contraction. Even for a fixed set of terminals Q, it was unclear 
how to modify a fractional solution in order to preserve feasibility after contraction — a question that had a 
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simple answer in the integral case. Our first goal will be to obtain an understanding of the structure of these 
modifications. 

While it can be avoided, it significantly simplifies the discussion to consider "blown up" versions of 
solutions to (LP). Consider any x G Q'^, and let G N be such that • S N for all C G /C. The minimal 
blowup graph corresponding to x is the unweighted multigraph defined as follows. First take the disjoint 
union of xc • N disjoint copies of C for each component C; then identify, for each v ^ R, all the copies 
of V. The edge costs of X are inherited from G in the obvious way. See Figure 1 for an example of an LP 
solution and its associated minimal blowup graph. Observe then that cost(Af) = N • cost(x). Note that X 
(along with N, but this will remain fixed throughout) encodes all the information in x. In particular, given X 
we can determine all of its components: these are simply the maximal connected subgraphs that are trees 
whose leaves are precisely the terminals spanned. Thus we can define T{X) as the set of components of a 
blowup graph X. Each component C € T{X) is a subgraph of X, but again, we will abuse notation when the 
context is clear and sometimes use C to refer to just the terminals of C. Thus, e.g., for some S C. R, S Ci C 
refers to the terminals in C that are also in S. 

We will need slightly more generality in our definition of a blowup graph. For any t G N, let Gt be the 
multigraph obtained by first taking t disjoint copies of G, and then for each v £ R, identifying all copies of v. 
For a solution y with corresponding minimal blowup graph y, we call a multigraph y' a (not necessarily 
minimal) blowup graph corresponding to y if (i) y' C Gt for some t G N, (ii) y' 5 y, and (iii) for any 
distinct terminals u,v ^ R, there is no u-v-path in y' that is not already present in y. Any edges in y' that 
were not in 3^ we call pendant edges. We will say that a blowup graph y is feasible if it corresponds to a 
feasible solution to (LP); otherwise we call it infeasible. Note that pendant edges have no effect on feasibility; 
they will always be removed in what we will later call a "cleanup" step. 

2.2 Edge removal after contraction 

Let X be the blowup graph corresponding to some solution x. We are interested in the situation after 
contracting some full component of G. In order to avoid some annoying technicalities, for now instead of 
contracting Q we will think of increasing the value of xq by 1. In other words, in terms of the blowup 
graph, we take N fresh copies of component Q and add it to X. We denote the new blowup graph obtained 
hy X ® Q. Formally, X ® Q is obtained by taking the disjoint union of X and N copies of Q, and then 
identifying all copies of v for each v ^ R. 

It is clear that X ® Q is not feasible. We are interested in describing the set of edges F C E{X) that can 
be removed so that {X ® Q) — F is feasible. 

This is the primary reason that it is simpler to work with the blowup graph X rather than x; this 
modification operation is much simpler than an equivalent operation defined on x. For example, removing a 
single edge from X can have the effect of splitting up some component G into subcomponents Ci and C2; 
the corresponding effect on x is to reduce xc by and increase xc-^ and xc2 by the same amount. 

Unfortunately, the set of all possible edge removals is not so well behaved. In order to expose the structure 
we need, we must consider minimal removals. Let 

Bq = {B C E{X) I {X ® Q) — B is feasible, and B is minimal with this property}. 

Figure 2 shows an example; after a set B G Bq is removed, an edge of the blowup graph becomes pendant, 
and so can also be removed without affecting feasibility. 

One of the most crucial elements of our analysis is the following: 

Theorem 2,1, For every component Q, Bq forms the set of bases of a matroid Mq. 
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In particular, it follows that any minimal removal set has the same number of edges; this number turns out 
to be A^dQI — 1). We are able to give a precise description of the matroid Mq by giving its rank function; 
more details of this will be given in Section 3. We can also show that the matroid is a gammoid (a special type 
of matroid related to flows); see Appendix A. As an aside, we note that Mq depends only on the terminals of 
Q, and not its structure; we could actually define a matroid Ms for any subset S of terminals, but this will 
not be important for our purposes. 

We will now study which edge sets can be removed after the random contraction of a component. Even 
though we will finally present a deterministic algorithm, this analysis will be helpful in guaranteeing the 
existence of removal sets with certain properties by an averaging argument. 

As before, let X be the blowup graph corresponding to a feasible LP solution x. Upon contracting 
component Q, we may remove some edges in order to again obtain a feasible solution. In particulai", by 
Theorem 2.1, we can remove any basis of Mq. For added flexibility, we allow choosing a basis Bq G Bq 
randomly, according to any distribution we like. In this case, each edge e will be removed with some 
probability q^. The probability vectors that are attainable are simply the convex combinations of incidence 
vectors of the bases; in other words, precisely the vectors in B{Mq), the base polytope of Mq. 

Now consider, as in [4], randomly contracting a single component, with component Q G T{X) contracted 
with probability l/|r(Af)|. Note that since each original component Q ^ K, has Nxq copies in this is 

the same as contracting a component in K, with probability proportional to Xq. Again, we allow ourselves to 
choose an arbitrary distribution over Bq for removals on contracting Q, and ask what probability vectors p 
describing edge removal probabilities are attainable. But any such probability vector is given by some convex 
combination \y{x)\ '^Qev{x) where q'^ G B{Mq). In other words, the attainable probability vectors 
form precisely the polytope Brem given by the Minkowski sum 

This implies that Brem is a polymatroid [14]; from our knowledge of the rank functions of the Mq's, we can 
also describe the rank function of Brem, as will be described in detail in Section 3. 

In the following, we use scaled cost to refer to costs reduced by a factor of N, compensating for the 
blowup factor. The goal is to show that the expected scaled cost of removed edges is large, compared to the 
expected cost of the component that is contracted. Perfection would be if we could always remove edges of 
total scaled cost as large as the cost of the contracted component, but of course this is not possible (it would 
imply an integrality gap of 1). Thus we lower our goals slightly. It is possible to show that there is a point 
p G Brem with p^ > ^|p^^^| for all e G E{X). This gives an expected decrease of cost(A') / (2|r(A') |) in the 
LP solution after scaling down, and the expected cost of the contracted component is cost(A')/|r(Af)|; so 
this implies only an uninteresting bound of 2 on the integrality gap. Instead, we must choose the distribution 
more carefully. 

More precisely, we will choose a well-structured subset K C E{X) and only consider removal probabili- 
ties p G Brem whosc support is contained in K. The set K will be chosen to be a minimal subset of E^X) 
whose removal from E{X) disconnects all terminals in the blowup graph. We call such a set a splitting set^. 
Interestingly, the family of all splitting sets form the bases of a cographic matroid, since K is a splitting set 
precisely when E{X) \ K is a spanning tree in the graph obtained from X by contracting together all its 
terminals. As we will see more formally in the proof of Theorem 2.2, when choosing ii' to be a splitting 
set, the set Bq = {B G Bq \ B C K} is nonempty for every Q, and so form the bases of the matroid Mq 
obtained by restricting Mq to K. This implies that the polytope B^^ = {p ^ Brem \ supp(p) C K} of 
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removal probabilities we consider is nonempty, and thus forms the base polytope of the polymatroid obtained 
by restricting the polymatroid corresponding to Brem to K. 

Once we have chosen some splitting set K, we will call edges in K core edges, and all other edges 
cleanup edges. To see the reason for this name, recall that the mati'oid Mq describes only the minimal edge 
removals upon contracting Q. However, there may be other removals that are possible; for B G Bq, there 
may be pendant edges in (Af ® Q) — i? which can be removed without having any effect on feasibility. Our 
choice of K ensures that for any edge e € B{X^ \ K,e can be deleted ("cleaned up") once enough edges of 
K^C have been removed. But just as importantly, we can prove 

Theorem 2.2. If K is any splitting set, then there is a distribution over Bq for each Q G r(^Y) such that 
if Q is chosen uniformly at random from r(^), and then B is chosen from Bq according to the chosen 
distribution, then 

IP{e £ B] >N/\T{X)\ for each e€K. 
This is discussed further in Section 3. 

2.3 The algorithm 

For the accounting in our analysis, we will need to keep track of precisely which edges in E{C) n K must be 
removed before an edge e G E{C) \ K can be deleted (cleaned up). Define W{e) C K, the witness set of 
edge e, as the unique minimal set of edges such that after removing W{e), e becomes a pendant edge and can 
be cleaned up. The fact that there exists such a unique set is shown in Lemma B.l in the appendix. We also 
define W{e) = {e} if e ^ K. Figure 3 shows an example of a witness set. 

We define a weight (distinct from the cost) on all core edges in such a way that the total weight of core 
edges equals the total cost of Af , by charging the cost of a cleanup edge to the core edges in its witness set. 
More precisely, let 

w{e) = c{e) + Yl f"'"^!!"^^- 

The following is an easy consequence of Theorem 2.2 and the fact that XleGii' ^C^) = cost(A'): 

Lemma 2.3. Let K be any splitting set. There exists some component Q such that cost(Q) < w{B'^) / N , 
where B^ is a maximum weight basis of Mq. 

For a given Q, a maximum weight basis of Mq can be found via a greedy approach; all that is needed is 
an independence oracle. This we can obtain immediately from our understanding of the rank function of Mq ; 
it can be computed using submodular function minimization (see (4) in the next section). However, while 
polynomial time, this is quite slow. We can instead exploit the result that Mq is a gammoid, giving a much 
faster independence oracle based on solving a maximum flow problem; this is discussed in Appendix A. 

We are now ready to describe precisely our deterministic algorithm, given in Algorithm 1. In the 
algorithm, at each stage we choose a component Q and contract it (in the usual sense, yielding an instance 
with a smaller vertex set). Thus at intermediate stages of the algorithm, X will be a feasible blowup graph of 
some contraction of the original graph G. We also emphasize that the witness sets W{e), and hence also the 
weights w{e), depend on the blowup graph in the particular iteration. 

We now define, for any blowup graph X and splitting set K, a potential function ^k{''^) by 

^k{X):= c{e)H{\W{e)\\ 

e(iE{X) 

where H{tj := l + l/2 + -- - + l/£is the harmonic function. 
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Algorithm 1: A deterministic algorithm for Steiner tree demonstrating a ln(4) integrality gap. 
Input : Graph G with edge costs c and terminal set R, feasible blowup graph X, and splitting set K. 
Result: A Steiner tree T. 

T ^ 0. 

while T is not a Steiner tree do 

Find a component Q G T{X) and maximum weight basis B G Bq with cost(Q) < w{B) /N. 

Cleanup: Let F = {e ^ K \ W{e) C B}. 

Update: T ^TuQ, X ^ {X - B - F)/Q, K ^ K\B. 

end 



Theorem 2.4. For any minimal splitting set K and feasible blowup graph X, Algorithm 1 yields a solution 
of cost at most (^k{X)/N. 

The proof of this theorem (given in Appendix B) essentially boils down to showing that in a single step 
of the algorithm, the expected cost of the contracted component is no larger than the decrease in the potential 
function scaled down by Let Xt and Kt be the blowup graph and splitting set at iteration t of the 

algorithm, with Bt the selected removal set. We are able to show that <^Kt{^t) — ^Kt+A^t+i) > w{Bt), 
from which the theorem immediately follows. 

From this, we can use an averaging argument to show the ln(4) integrality gap bound. Essentially, if K is 
chosen randomly from the matroid of possible minimal splitting sets according to an appropriate distribution, 
it can be shown that 

¥.{^k{X)] < ln(4) • costCA-). 

It is also possible to minimize ^xi'^) as a function of K, via a dynamic program. The full proof can be 
found in the appendix: altogether we obtain, recalling cost(A') = N ■ cost(x). 

Theorem 2.5. For any solution x of (LP), and choosing K to minimize ^k{'^), Algorithm 1 returns a 
solution of cost at most ln(4) • cost(j;). 

We emphasize again that while we have described everything in terms of the blowup graph, it is possible 
to implement Algorithm 1 directly in terms of the LP solution, yielding a polynomial time algorithm. Details 
will be provided in the full version. 

2.4 Quasi-bipartite graphs 

The situation is much simplified in the case of quasi-bipartite graphs. In this case, we may choose K to 
consist of all edges except for the cheapest in each component. This clearly minimizes <I>if (A"), and it can be 
shown that 

Lemma 2.6. Let K = E{X) \ Emin, where Emi„ consists of a cheapest edge from every component. Then 

^k{X) < I . cost(A'). 

A 73/60 < 1.217 bound on the integrality gap immediately follows from Theorem 2.4. One of the major 
drawbacks of relying on (LP), or any of the hypergraphic LPs, is that solving them is computational intensive; 
in general, to obtain a 1 + e approximation, nothing better than n^"*^'''' time is known. This can be improved 
somewhat to n^^^/'^^ in quasi-bipartite graphs, but this is still very slow. We show how to sidestep this issue 
and obtain a reasonable running time for quasi-bipartite graphs by instead solving the much more tractable 
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bidirected cut relaxation, which has only O(n^) variables. Combined with the fact that we do not need to 
re-solve the LP in each iteration, we obtain a markedly faster algorithm than the one of Byrka et al. [3]. 

More precisely, we show how a solution to the bidirected cut relaxation can be transformed into a solution 
to (LP) with the same cost, via a natural greedy procedure. One step of the transformation consists of 
taking, from a star centered around a Steiner vertex, all arcs with incoming flow and one arc with outgoing 
flow. This yields one component for (LP); the capacities are then uniformly reduced on these edges and the 
process is continued. The details of this are given in Appendix D. Previously, [6] showed that the bidirected 
cut relaxation always has the same objective value as the hypergraphic relaxations, suggesting that such a 
transformation should exist, but the question remained open. 

3 Deeper into the matroid structure 

In this section, we discuss in more detail the heart of our arguments; uncovering the matroid structure of 
edge removals, and showing appropriate uniform removal probabilities after the random contraction of a 
component. 

In what follows, we will often need to refer to the terminal set of a component C, so we will again abuse 
notation and write, e.g., \ C\ for the number of terminals in C. Define : 2^ — )• N by 

hx{S) = N{\S\-l)- (|5nC|-l)+. (1) 

cer(A') 

It is immediate from (LP) that X is feasible if and only if 

hx{S) > V5 C i?, S / and hx{R) = 0. (2) 

Indeed, hx{S) is, up to scaling, simply the slack (or if negative, violation) of the corresponding constraint in 
(LP). Two important properties of hx are the following: 

Lemma 3.1. For any blowup graph X, 

i) hx is intersecting submodular, i.e., for any two sets Si, S2 ^ E{X) with SiCi S2 ^ 0, 

hxiSi U ^2) + hxiSi n S2) < hx{Si) + hx{S2), and 

ii) for any F C E{X) and$^S^R, hx{S) < hx-F{S) < hx{S) + 

Proof, i) This follows immediately from the fact that for any C <^ R, the function S*— T-dS'nC'l — l)+is 
intersecting supermodular 

ii) The removal of any additional edge e G E{X) from X leads to a split of some component C of X 
into subcomponents Ci, C2 with Ci n C2 = 0, Ci U C2 = C. Hence, 

hx-e{S) - hx{S) = {\s nc\- 1)+ -{\snCi\- 1)+ -{\snC2\- 1)+ g {0, 1}, 

which leads to hx{S) < hx~e{S) < hx{S) + 1. Applying this repeatedly yields the claim. □ 

An interesting consequence, that essentially follows by intersecting submodularity of hx and standard 
uncrossing techniques, is that any basic feasible solution to (LP) has a support of size bounded by — 1 
(see, e.g., [10] for an example of this reasoning). For an equivalent version of (LP), this result was already 
obtained through a rather involved technique by Chakrabarty et al. [6]. 

For convenience, define 

hp{S):=hx-F{S) = N{\S\-l)- (l'5nC|-l)+. 

C&[X-F) 
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The following lemma describes feasibility of {X ® Q) — F ina convenient form, and also shows that we 
need only consider constraints corresponding to subsets containing Q. 

Lemma 3.2. The blowup graph {X ® Q) — F is feasible if and only ifhp{R) = N{\Q\ — 1) and hp[S) > 
N{\Q\ - 1) for all Q. 

Proof Let X' = {X ® Q) - F. Then X' is feasible iff hx'{S) > for all S C R, S ^ 0, with equahty for 
S = R. But 

h;,,{S) = hp{S)-N{\SnQ\-l)+, (3) 

and so this can be equivalently stated as hp{S) > N{\SnQ\ - 1)+ for all 5/0, and hp{R) = N{\Q\ - 1). 

All that needs to be proved then is that only the constraints for 5 3 Q need to be considered. So suppose 
5 is a violated set: hx'{S) < 0. Then 5 n Q / 0, otherwise hx'{S) = hp{S) > hx{S) > by feasibility 
of X. But for any such S, 

iV(|5|-i)-iV(|5nQ|-i)+ = iV(|5ug|-i)-iV(|Q|-i)+ 

andclearly ^ (|5nC|-l)+< ^ (|(5 U Q) n C| - 1)+. 

C&V{X-F) C&T{X-F) 

Subtracting and using (3), we obtain that /i;t'/(5U(5) < hx'{S). Since 5 was a violating set, so is 5 UQ. □ 
Let E{X) be any subset of E{X), and define rq : E{X) ^ N by 

rQ{F) = mmhp{S). (4) 

We will show: 

Proposition 3.3. The function rq is the rank function of a matroid of rank N{\Q\ — 1). 

Once we have this, it is straightforward to show that this matroid precisely describes the minimal edge 
removals: 

Theorem 3.4. The set of bases of the matroid defined by rq is precisely Bq. 

Proof. Let Bq be the set of bases of the matroid defined by rq, and consider any B e Bq. By the definition 
of rq, we have that 

hsiS) > rqiB) = N{\Q\ - 1) for any S^Q. 
Moreover, by Lemma 3.1 (ii), 

hs{R)<h;^{R) + \B\=N{\Q\-l); 

the final equality follows since \B\ = rq{B) = N{\Q\ — 1) and hx{R) = by feasibility of X. Thus by 
Lemma 3.2, {X ® Q) — B h feasible. 

Conversely, consider any B G Bq. By feasibility and Lemma 3.2 again, hi^{S) > N{\Q\ — 1) for all 
S ^Q, with equality for S = R. Thus rQ(5) = A^(|Q| - 1), and so there is some B' C B with B' £ Bq. 
But then B' is also a feasible removal set by the above, and so by minimality B' = B. □ 

Proof of Proposition 3.3. First, observe from (1) applied to the empty blowup graph that 

rq{EiX)) = miniV(|5| - 1) = 7V(|Q| - 1). 

o — JCj/ 
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We must show that vq is increasing, submodular, and satisfies rQ{F) < \F\ for all F C E{X). The fact 
that rg is increasing follows immediately from the definitions of rg and hp; removing a larger set can only 
increase the slack. Considering some fixed S D Q,we have by Lemma 3.1 (ii) that hx-F{S) < hx{S) + 
Thus rQ{F) < rQ{^) + \F\ = \F\ since rQ(0) = by feasibility of X. 

Now we come to the main part of the proof, showing that rg is submodular. We must show that for any 
Fi C ^2 ^ E{X) and e ^ F2, 

rgiFi + e) - rQ{Fi) > rqiF^ + e) - rgCFa). (5) 

It is clearly sufficient to show this for Fi and F2 differing by a single edge. Consider any S D Q and 
i G {1,2}. The difference 

h^eiS)-hp^{S)= Yl i\cns\-i)+ - (|cns|-i)+ 

C&r{X-F,) CGr{X-iF,+e)) 

is one or zero, and it is one precisely if e splits up some component C G r{X — Fi) into two components 
Ci, C2 that both intersect S. If this is the case for some component in r{X — F2), then e will also split up 
some component mT{X — Fi) into two pieces both intersecting S, since — F2 is a subgraph X — Fi. 
Thus for any S ^ Q, 

hp^,{S) - hp^{S) < hp^^iS) - hp^iS). (6) 
It also follows that for any S,S' <^R with Q C S C S", 

hp^,{S) - hp^iS) < V— - hp^{S'). (7) 

Let Si be the set of terminal subsets containing Q that minimize hp_{S), over all S ^ Q. Since hp^ 
is intersecting submodular by Lemma 3.1, there is a unique maximal set S\ G 5i, meaning SI ^ S for 
all S ^ Si. Similarly, there is a unique minimal set 5*2 G ^2; so 5*2 C 5 for all 5" G ^2. We first show 
8*2 C SI- Notice that for 5" D Q with 5" ^ cSi we have hp^{S) > hp^{S) > hp^{Sl) + 1, where the first 
inequality follows by Lemma 3.1 (ii). Furthermore, hp^{Si) < /i^^ (5^") + 1, again by Lemma 3.1 (ii). Hence 
hp^{Si) ^ hp^ (S) VS ^ Q, S ^ Si, and thus Si must contain some minimizers ofhp^, i.e., 5i n ^2 7^ 0. 
Since S2 is the minimal set in ^2 and SI is the maximal set in Si we obtain 51 C S'J'. 

We finally have 

by (6) 

by (7), since S^ C SI 

□ 



rQ{F2 + e) - rQ(F2) = hp^^{S*2) - hp^{S*2) 
<hp^e^Sl)-hp^{Sl) 
<hp^^{Sl)-hp^{Sl) 
= rQ{Fi+e)-rQ{Fi). 



Proof of Theorem 2.2 

We first show that for any component Q, 

rQ{K) = N{\Q\-^)- 

This in turn implies that Bq = {B G Bq \ B C K} is nonempty, since the rank of Mq is A^(|Q| — 1) by 
Proposition 3.3, and so Brem 7^ 0- Notice that for S R, S ^ %, vie have hxiS) = N{\S\ — 1), since in 
X — K a\\ components contain precisely one terminal. Hence, 

rQ{K) = mmhK{S) = N{\Q\-l). 
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As already discussed in Section 2, the polytope Brem is simply a weighted Minkowski sum of the base 
polytopes B{Mq) for Q G r(A'). It is well known that the Minkowski sum of matroid polytopes is a 
polymatroid, and moreover, the rank function of the sum is simply the sum of the rank functions of the 
summands [14]. Thus, Brem is the base polytope of a polymatroid with rank function 

' ^ ^' Qer(A') 

To show that the point p given by = A^/|r(^)| for all e G K is in Brem, we need to show that 
r{F) > |F| • A^/|r(A')| for every F C K. Expanding out (8) and the definition of rq, and writing Sq for 
the subset S ^ Q that attains the minimum in (4), we obtain 

We now observe that because K is a splitting set, hp{R) = \F\. For imagine removing the edges of F from 
X one by one; hx-F{R) — hx{R) just counts the number of times where a component is split by the deleted 
edge in this process. But by the nature of minimal splitting sets, this must happen at every step — no pendant 
edges are formed at any stage. Hence hp{R) — hx{R) = |^|; moreover, hx{R) = by feasibility, so indeed 
hp{R) = Thus to finish the proof, it suffices to show 

Claim 3.5. Y.Qev[x) ^FiSq) > N ■ hp{R). 

To prove Claim 3.5, we replace the function hp on the left-hand side of the inequality by a function 
/ that lower bounds hp and is well structured. More precisely, / is chosen to be a conic combination of 
a special type of intersecting submodular functions which we call partition functions: for any partition 

V = {Pi, . . . , Pn} of R, the corresponding partition function f-p is given by 

fv{S) = G N I P,- n 5 / 0}| - 1)+ V5 c R. 

The following theorem (whose proof can be found in Appendix C) guarantees the existence of the function / 
that we need to prove Claim 3.5. 

Theorem 3.6. Let /i : 2^ — )• ]R_|_ any nonnegative intersecting submodular function with = Q for all 

V G U. Then there is a monotone intersecting submodular function f of the form 



i=l 



for some k gN, where Aj > and "P* is a partition of U for each 1 < i < k, satisfying: 

i) f{S) < h{S) for all S <^ U, and 

ii) f{U) = h{U). 

Consider the function h~p defined by h~p,{S) = max{hp{S), 0}. Then h'p differs from hp only on the 
empty set, since hp{S) > for all S ^ $. Thus /i^ is still intersecting submodular, and also nonnegative. 
Let / = '}2!i=i Kfv^ be the function obtained by applying Theorem 3.6 to h'p. We then have 

E hp{SQ)> Yl /(-^q)^ E fiQ) 

Qer{x) QeT(x) Q(^r(X) 

k k (9) 

= E E^^/^»(^) = E^i E 

QeT(x) i=i 2=1 Qerix) 
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where the first inequahty holds since hp{S) = h~p,{S) > f{S) for all 5 7^ 0, and the second inequality holds 
since / is monotone and Q ^ Sq. 

As observed by Chakrabarty et al. [6], any solution x to (LP) satisfies the following partition constraints 
for any partition V of R: 

^cMC) > \v\ - 1, 

ce/c 

where \ V\is the number of sets in partition V. In our blown-up setting this translates into 

MQ)>N{\v\-i). 

Combining this observation with (9) and using \ Vi\ — 1 = f'p^{R), Claim 3.5 follows since 

k k 

Qer(A') i=i Qer{x) i=i 

k 

= N-Y, A.M(i?) = N ■ f{R) = N ■ hp{R), 
where the last equality follows from property (ii) of Theorem 3.6. 
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(a) fractional solution x (b) blowup graph X 



Figure 1: In (a): fractional solution x (components drawn in different gray scales and labelled with their capacity xc)- 
In (6): blowup graph XioxN = 2. 




(a) blowup graph X %Q (b) blowup graph X®Q—B after removal 



Figure 2: In (a): X ®Q, edges in B C E{X) in bold, copies of Q are dotted, terminals in Q are filled gray. In (6): 
feasible blowup graph X ®Q — B (which is not minimal due to the pendant edge that may also be removed). 




Figure 3: Illustration of the definition of W{e). Depicted is some component C with core edges K (soUd) and cleanup 
edges (dashed). 
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A Separation and gammoid structure 



In this section, we investigate the separation problem for (LP). Although it is not necessary, for convenience 
we will work in the blown up formulation; thus, for a given X, our goal is to determine whether (2) is satisfied 
(see Section 3 for details of this and the definition of h^)- In fact, we will do more; for any Q R, Q 
we will find the most violated set over all S ^ Q. Given this, we can answer the separation question by 
checking that min^^ j^j hx{S) is zero for each choice of v (note that hx{%) = —N, and so we must exclude 
this trivial set from consideration). For each choice of v, one max-flow calculation will be required. 

The construction is inspired by one for the forest polytope [17, 16] (see also [22, §51.4]). While 
what follows is not precisely a generalization (in the case where all components have size 2, the resulting 
construction is slightly different), it is similar in spirit. In the directed component-based relaxation, separation 
via an equivalent flow-based formulation is completely straightforward. However this does not imply such a 
formulation for the undirected version. 

Since hx is an intersecting submodular function, it follows already that vaSns^Q hx{S) can be computed 
in polynomial time [22], using submodular function minimization as a black box. However, the combinatorial 
algorithm we demonstrate here, which reduces the separation problem to a max-flow calculation, gives some 
additional insights (as well as being more efficient). 

Let X be the blowup graph of some solution x. First, let 

Vv = \{C G r{X) :veC}\-N forve R. 

If Uv is negative for any v, it is easily seen that X is not feasible (it corresponds to x{6{v)) < 1). So from 
now on, we assume Uv > for all v ^ R. Construct a directed multigraph D = {W, A) (we will write Dx if 
we wish to be explicit on the choice of X) as follows. Begin with the multigraph X, and for each component 
C G r(Af), pick an arbitrary node rc as the root. Adjoin a source node s and sink node t. Now orient all 
edges of E{C) away from rc for each component, and adjoin the arcs src for each C G r(Af), and vt for 
each V £ R. We assign capacities z to the arcs; z{vt) = for all v £ R, and z{a) = 1 for all other arcs. 

Theorem A.l. For any nonempty Q Q R, the value of the maximum s-{Q U {t})-flow in D is equal to 
y{R) + N + uiinsDQ hx{S). More specifically, ifU* is a minimum (Q U cut in D with s ^ U*, then 

S* = U* n R minimizes hx{S) over S ^ Q, and 

z{6-{U*))=y{R) + N + hx{S*)- 

Proof. Use E{C) to denote the arcs in D corresponding to E{C) in X, and let A{C) = E{C) U {src}. For 
any S" C i?, let u{S) C 1^ be defined by 

v{S) = S^{t} U u {y{C)\R). 

cer(A'): 

Claim A.2. For any nonempty S <^ R, z{6~ {i^{S))) = hx{S) + y{R) + N. Moreover, for any U C W 
with s ^U,t £U andUr\R = S,we have z{5-{U)) > hx{S) + y{R) + N. 

Proof Consider some component C G T{X). If 5 fl C = 0, then clearly 6^ {v{S)) n A{C) = 0. So suppose 
5 n C / 0. If rc G v{S), then clearly 6' {u{S)) n E{C) = 0, and src G -5" {v{S)). On the other hand, if 
rc ^ i^{S) (implying in particular that rc is a terminal), then src ^ ^nd \6~{v{S)) fl E{C)\ = 1, 

since all terminals are leaves of the components they belong to. In either case, z{6^ {v{S)) n A{C)) = 1. 
For any v £ R, sv £ 6^ (^("S*)) if and only ifv^S. Putting this all together, 

zi6-{,^{S))) = \{C G T{X) :CnS^9}\ + Y.y- 
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Now taking (1) and adding and subtracting y{S), we have 

h;,iS) = Ni\s\-i)- {\cns\-iy+l \c n S\ - N\S\] - y{S) 
C£r{x) \cer(A') / 

= \{C G T{X) :Cn5/0}|-iV - y{S) 

= z{6~{u{S))-N-y{R). 

Now consider any U with t ^ U, s ^ U and U H R = S. We again clearly have sv G S~{U) for all 
veR\U, and again S~{U) D A{C) / if 5 n C7 / 0. So z{6-{U)) = hx{S) + y{R) + N. □ 

By this claim, z{6^ {u{S*)) < z{5^ {U*)); since U* is a minimum cut, we must have equality. Then 
again by the claim, 

z{5-{v{S*))) = hx{S*) + y{R) + N. □ 



We now show how this leads to a description of the matroid Mq as a gammoid. Recall the definition 
of a gammoid: a directed graph H is given, along with two subsets X,Y Q V{H). The groundset of the 
gammoid is X, and a set / C X is independent if there are vertex-disjoint paths from / to some subset of 
Y. We say in this case that this defines the gammoid /rom X to Y in H. It is convenient to observe that 
by transforming the digraph H appropriately, we can replace vertex-disjoint in the above definition with 
arc-disjoint, and still characterize gammoids. 

We need to slightly tweak the digraph D defined above. For each / G E{X), there is a corresponding arc 
a in D. Split the arc by adding an additional node vj, producing a "front" arc with tail vj and a "back" 
arc with head vj. We may also remove the node s and all its adjacent arcs. Call the resulting modified 
digraph D'. 

Define the sets 

X = {vf \ f € E{X)}; X' = \J rc and Y = Qu{t}. 

cer(A') 

Let 0Q be the gammoid defined on D from X' U X to Y, requiring arc-disjointness rather than vertex- 
disjointness. Then define ©q = 0q/X'; this contraction is also a gammoid. By the one-to-one correspon- 
dence between X and E{X), we may consider this is a matroid over E{X). 

Theorem A.3. For any component Q, &q = Mq. 

Proof. The rank of a set [/ C X in &q is p{U) = p'{U U X') — p'{X'), where p' is the rank function of 
&Q. Notice that the maximum number of arc-disjoint paths from X' to Y is precisely the max -flow from s to 
Q U {t] in D. Thus by Theorem A.l, and the definition of rq, p'{X') = rQ{%) + y{R) + N. 

Now p'{U U X') is the maximum number of arc-disjoint paths from U VJ X' to Y. But imagine what 
would happen to p'{U U X') if the arcs Ajj = {a^ \vf^U} were removed from D' . Take Pi, . . . , to be 
any maximum collection of arc-disjoint paths from U VJ X' toY in D'. For some Vf ^ U,if some path Pi 
uses arc a^, then certainly no other path emanates from vj, and so we can simply remove the initial segment 
of Pi before Vf to obtain another maximum collection of disjoint paths that do not use Oj. Repeating this 
process, we obtain paths P[, . . . , P^ that do not use any arcs in Aij. But taking D' — Au, and contracting all 
of U to form the source, yields precisely Dx-f, the digraph for the separation construction corresponding 
toX - F. Thus again by Theorem A.l, p'{U U X') = rQ{U) + y{R) + N. Thus p{U) = rQ{U), and so 

= Mq. □ 
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B Proofs for Section 2.3 



Let us fix a component C S and a splitting set K. By the definition of K (as the complement of a 

spanning tree in the graph C obtained by contracting terminals), every Steiner node u G V{C) \ C has a 
unique path Pu ^ E{C) \ K of cleanup edges to a terminal that we term r„ G C (see again Figure 3). 

Lemma B.l. For any splitting set K C E{X), component C G r(^Y) and edge e G E{C) \ K, let 



Then W{e) is the unique minimal subset of E[C) n K whose removal makes e a pendant edge. 

Proof. Let W C E{C) n K be any subset of splitting edges. If uv G W{e) \ W then e remains on a path, 
namely U U P^, C E{C) \ W between the terminals, implying that e is not pendant. Thus, any subset 
W which makes e pendant must contain W{e). 

On the other hand, we claim that e is pendant in E{C) \ W{e). To see this, let e = ab with e G Pa. For e 
to be pendant, there would need to be a path P that does not contain e from a to a terminal. But the first edge 
inK onP must be in W{e), contradicting the fact that P C E{C) \ W{e). □ 

Theorem 2.4. For any splitting set K and feasible blowup graph X, Algorithm 1 yields a solution of cost at 
most <^KiX)/N. 

Proof. We prove the theorem by showing that the decrease in the potential at any iteration is lower bounded 
by the weight of the edges we remove. More formally, consider a given iteration t with current blowup 
graph Xt, splitting set Kt, and weights wt. let Qt be the component to contract and Bt G 13 the edges to 
be removed from Xt in this iteration. At the end of iteration t a new blowup graph Xt+i is obtained with 
splitting set Kt+i = Kt \ Bt. We will show 



This in turn implies the theorem since the potential function at any iteration, and in particular at the end of 
the algorithm, is nonnegative. Therefore, the total weight of all core edges being removed throughout the 
algorithm is upper bounded by the potential value of the initial blowup graph, i.e., J2t '^t{Bt) < ^k{^)- 
Furthermore, since at every iteration, Qt and Bt are chosen such that cost{Qt) < wt{Bt)/N, we obtain that 
the cost of all contracted components — which is the cost of the Steiner tree our algorithm returns — can be 
upper bounded by cosi{Qt) < ^ Ylit '^t{Bt) < ^k{'^) /N, as desired. Hence, it remains to prove (10). 

For any edge e G Aft, we denote by Wt{e) its witness set at the beginning of iteration t. For simplicity, we 
define Wt+i on all of E{Xt), defining Wt+i(e) = for e G E{Xt) \ E{Xt+i). By definition of the witness 
sets, we have 



W(e) = {uv£Kn E{C) I e G P„}. 



^M)-^K,U^t+i)>wt{Bt). 



(10) 



Wt+i{e) = Wt{e) \ Bt for any e G E{Xt). 
Expanding the left-hand side of (10), we obtain 



(11) 



^KAXt)-^K,^AXt+i)= Yl cie){H{\Wt{e)\)-H{\Wt+i{e)\)) 



\Wtie)\ 




eeEiXt) fc=|Wt+i(e)|+l 




(12) 
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Furthermore, by expanding the right-hand side of (10) using the definition of the weights wt, we obtain 



eem(/) 



E E 



c e 



Inequality (10) finally follows by combining (12) with (13). □ 



In the following, we show that K can always be chosen s.t. < ln(4) • cost (A'), following the 

proof of [4]. For the sake of a simpler exposition, we replace every Steiner node in X of degree higher than 3, 
with a binary tree consisting of cost zero edges in order to obtain nodes that have degree exactly 3. Suppose 
we find a suitable pair (K, F) of splitting and cleanup edges in this auxiliary graph. Then every Steiner node 
u in the original graph has potentially several paths Pi , . . . , Pg C F of cleanup edges to terminals. We keep 
the one path minimizing c(Pj) and discard the first edge of all other paths. This does not increase (^k{^)- 
Applying this iteratively, we end up with a feasible pair of cleanup edges and splitting edges. 

From now on, we assume that every component C G r(A') is a binary tree. We pick an arbitrary edge 
6(7 G E{C) as root edge. From any interior node u G V{C) \ C, there are two outgoing edges (these are the 
edges that do not lie on the path from u to the root edge). We randomly pick one of these edges as cleanup 
edge and the other one as splitting edge. In other words, every interior node u has a unique path of cleanup 
edges to some terminal and hence, is a legal splitting set. Moreover, for every non-root edge e one has 
P{e eK} = l. 

Lemma B.2. If E{X) is chosen randomly according to the above distribution, 

¥.{^k{X)] < ln(4) • cost(Af). 

Proof. Fix a component C and an edge e G E{C). It suffices to show that E{i/(|l^(e)|)} < ln(4). The root 
edge is always a splitting edge, thus | Vl^(ec)| = 1- So, let e be a non-root edge and let vo,vi, . . . , v^^i be 
the path from e to the root edge, i.e. vqvi = e and VkVk+i = ec- Let 

X := max{i | vqvi, viV2, ■ ■ ■ , Vi-iVi G E{C) \ K} 

be the number of consecutive cleanup edges on this path, starting from e (and X = if already vqvi G K). 
Then F{X = i} = for i < and F{X = k} = (i)'=. Furthermore \W{e)\ = X + 1 if X < A; and 

|VF(e)| = k otherwise. We calculate 

k-l 

E{H{\W{e)\)} < ^P{X = i} ■ H{i + 1) + ¥{X = k] ■ H{k) 

1=0 

<E«(* + i) ( ) 

i=o ^ ' 

= ln(4). 



□ 
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The above argument can be derandomized by the method of conditional expectations, and this leads to a 
proof of Theorem 2.5. Another option is to observe that the best choice of K can be found in polynomial time, 
via a dynamic program as is indicated below. Combined with the above lemma, this implies Theorem 2.5. 

Lemma B.3. A splitting set K minimizing <I>/^(Af) can be found in polynomial time. 

Proof. Since the potential function can be decomposed into terms corresponding to each component, and a 
splitting set K consists of the union of splitting sets in each component, it suffices to consider each component 
separately. Hence, let C be any fixed component with vertices V{C) and edges E{C); our goal is to find a 
splitting set K for C that minimizes Yle&E(c) 

As usual when applying dynamic programming to problems on trees, we start by computing tables (to be 
specified soon) for subtrees consisting of a single terminal, and successively combine those tables until a table 
for the full tree is obtained, revealing the optimal splitting set. To specify the order in which we create tables 
for larger subtrees from smaller ones, we direct the edge of the tree C away from an arbitrarily chosen node 
in V{C). We consider the following type of subtrees that we call partial trees. For any vertex r € V{C) and 
subset U C 5+(r) of arcs leaving r, the partial tree Tjj with root r is the induced subgraph of C consisting 
of r and all vertices that can be reached from r with paths starting with one of the arcs in U. To simplify 
notation we also use Tu to refer to the edge set of the partial tree. Furthermore, let Tu = E{C) \ Tjj, and let 
Rtu ^ R denote the terminals contained in the partial tree Tu. 

To better understand what information should be stored for a partial tree T, we first briefly discuss how the 
choice of splitting set K within T impacts the witness sets in T, and vice versa. We will refer to the choice of 
core and cleanup edges (i.e., the choice of K) within some subset of edges as a configuration for that subset. 
We distinguish two ways that the root r of T can be connected to a terminal through cleanup edges: case (A) 
through a path within the partial tree T, and case (B) through a path outside of T. Correspondingly, we call a 
configuration for T a type (A) configuration if case (A) holds for the root of T, and a type (B) configuration 
otherwise. Notice that in a type (A) configuration, every node within T is connected to a terminal in Rt by 
cleanup edges. For a partial tree T we will store two tables, one corresponding to case (A) and one to case 
(B). 

Consider case (A) and let P be the path of cleanup edges connecting a terminal in Rt to r. Notice that in 
this case W{e) C T \/e G T. Hence, the configuration for T does not have any impact on the contribution of 
the edges of T to the function X]ee£;(C) However, the witness sets of the edges in P depend 

on the configuration for T, namely every core edge that can be reached within T from r by following cleanup 
edges is part of the witness set of any e £ P. Hence, the only information about the configuration for T that 
matters in finding an optimal configuration within T is the number a of core edges in T that can be reached 
from r through cleanup edges. Thus for case (A) we want to store a table for T which contains, for each 
value of a G {0, ... , |T|}, a corresponding type (A) configuration that minimizes X^gg^^ CeH{\W{e)\). Here, 
|VF(e)| can be computed without knowing the precise configuration in T (apart from a) since 



\W{e)\ 



\W{e)nT\ ife £T\P, 

\W{e)nT\+a ifeeP. 



Now consider case (B) and let P be the path in T connecting a terminal to r. In this case the situation 
is reversed and W{e) C T for any e G T. Hence, the configuration for T does not have any impact on the 
contribution of the edges of T to the function Yle&E{c) However this time, the witness sets 

of edges on P depends on the configuration for T, namely every core edges that can be reached within T 
from r by following cleanup edges is part of the witness set of any e £ P. Hence, the only information 
that has to be stored for T in case (B), in order to describe how the configuration within T impacts the 
configuration outside of T, is the number /3 of core edges in T that can be reached from r through cleanup 



19 



edges. Hence, for case (B) we want to store a table for T which contains, for each value of /3 G {0, . . . , |r|}, 
a corresponding type (B) configuration that minimizes X^eeT ^{^)^{\^{^)\)- 

Clearly, if we can compute the (A) table for the full component C, then we are done, since the globally 
best configuration is the one minimizing the potential function over all values of a. Computing type (A) 
and (B) tables for partial trees corresponding to single terminals is trivial: table (A) contains one entry 
corresponding to a = of value zero, and table (B) is empty. There are two constellation we exploit to 
compute tables for larger partial trees based on the tables of smaller ones. 

The first constellation is the following. Assume that we have tables (A) and (B) for two partial trees 
Tjj-^ and Tu^ with C/i n [/2 = 0, and both having root r. Then we can compute the two tables for Tjj-^\jU2 
from the tables of Tjj^ and Tu^ . This can be done by considering all legal combinations (meaning pairs 
of configurations that can be completed to a splitting set) of one table entry corresponding to Tu^ and one 
corresponding to Tu^, keeping the best ones. Since the size of each table is polynomially bounded in the 
input, this can be done efficiently. We skip the somewhat tedious details for combining those tables which are 
based on standard arguments. 

In the second constellation, we consider a vertex r and one of its out- neighbors v, i.e., there is an arc 
directed from rtov, such that both tables for T^+(^,) have already been computed. We can then compute the 
two tables for Tj^^j by considering all legal combinations of an entry of one of the tables of T^+^^.|^^^ and the 
two possibilities of rv being a core edge or a cleanup edge. 

It is easy to observe that starting from the terminals and leveraging the above two update rules, one can 
construct both tables for the full component C efficiently. 

□ 

For the following Lemma, we assume that the graph G is quasi-bipartite. 
Lemma 2.6. Let K = E{X)\ Emm, where Emin consists of a cheapest edge from every component. Then 

^k{X) < i • cost(A'). 

Proof. Consider a component C, which now is a star with edges ei, . . . , e^. Assume minimizes the cost, 
then the splitting edges in C are K r\C = {ei, . . . , efc_i}. First of all, K is obviously a legal splitting set. 
Secondly \W{ei)\ = 1 for i G {1, . . . , A: - 1} and \W{ek)\ = k-l. Thus 

k 

^ c(e) • H{\W{e^)\) <{k-l + H{k- 1)) • < ^ • cost(C), 



1=1 



using that 1 + ^^'^ ^ is maximized for k = 5. The claim follows by summing over all components 



C A lower-bound property of nonnegative intersecting submodular func- 
tions 3.6 

The main goal of this section is to prove Theorem 3.6. Before presenting the core part of the proof we discuss 
some basic properties of partition functions, and make some general observations concerning the statement 
of Theorem 3.6 which are useful to understanding its proof. 

Let U he a finite set. We recall that C 2^ is called a lattice family if it is closed under unions and 
intersections. A function J" M is submodular on T if f{A U 5) + f{A D B) < f{A) + f{B) for all 
A,B^F; supermodular on F, intersecting supermodular on T etc., are defined similarly in the obvious 
way. 
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Any partition V = {Pi, . . . , P„} of [/ induces naturally a lattice family F-p (12^ which consists of all 
possible unions of sets in V. Consider the coverage function a{S) = \{j G [n] | H 5 / 0}|, which is 
clearly submodular. Notice that we can write fviS) = {ci{S) — 1)+, and in particular f'p{S) = a{S) — 1 
for all 5 7^ 0. Thus f-p is intersecting submodular: for any A,B<^U with An B ^ ^, 

MA) + MB) = ia{A) - 1) + (a(P) - 1) 

> a{A UB)-1 + a{A D B) - 1 
= MA^B) + M^nB). 

Furthermore, it is easy to see that fp is intersecting supermodular on T-p. Hence fp is intersecting modular 
on V, i.e., fp{A) + fp{B) = fp{AU B) + fp{A n B) for any intersecting sets A,B £ Tp. 

By the above observation, the function / claimed by Theorem 3.6 is by construction intersecting 
submodular since all /pi are intersecting submodular. Similarly, / is monotone due to the monotonicity of 
fpi. We prove the following slightly stronger version of Theorem 3.6. 

Theorem C.l. Let /i : 2^ — t- M-|- any nonnegative intersecting submodular function, such that all maximal 
sets S U with h{S) = Oform a partition ofU. Then there is an intersecting submodular function f of 
the form 

k 

f = ^ \fpil 

i=l 

where /c < |C/| — 1, Aj > Vi G [k\, V^, . . . are partitions ofU that become coarser with increasing 
index, and f satisfies: 

i) f{s) < h{s) ys c u, 

ii) f{U) = h{U). 

Furthermore, the partitions "P* together with the coefficients Xi, and hence f, can be constructed efficiently. 

Notice that the condition in Theorem C.l stating that the maximal tight sets of h form a partition of U 
is equivalent to the property that the family of all tight sets of h covers U, due to the following uncrossing 
argument. If the tight sets of h cover U then so do the maximal tight sets; furthermore, for any two intersecting 
tight sets A,B OU, 

= h{A) + h{B) > h{A UB) + h{A nB)>0, 

by submodularity and nonnegativity of h; hence AU B is also tight. Hence, this condition is indeed weaker 
than the one used in Theorem 3.6, which states that all singletons must be tight. 

Proof of Theorem C.l. The partitions , . . . and coefficients Ai, . . . , Ajt defining / are obtained as 
follows. 

1. Let i = 1, /i^ = /i, and be the maximal tight sets with respect to h. 
1. While > 0: 

(a) Let \i G M+ be the maximum value such that 

h\s)-\iM{s)>Q yseTp.. 

(b) ^ - Xifp^■, let C J^j,, be the maximal tight sets with respect to 

(c) i ^ i + 1. 

We start by observing that each function encountered during the algorithm is intersecting submodular 
over Tpi-i (by convention we set "P" = 2^), and that indeed forms a partition of U. This can easily be 
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verified through an inductive argument. By assumption h is intersecting submodulai" over U, and V is a 
partition of U. The intersecting submodularity of = /i* — Xif-pi over J^-pi follows by the intersecting 
submodularity of /i* over T-pi and the intersecting supermodularity of f-pi over T-pi . Since is intersecting 
submodular over T-pi, the maximal tight sets of in Tpi thus again form a partition of [/. 

The suggested procedure can indeed be implemented efficiently. At any iteration i and for any fixed 
A > 0, finding the set S G Tpi minimizing h^{S) — Xfpi is a submodular function minimization problem. 
Hence, in step (2a), Aj can be found by using e.g. binary search, or by applying the parametric search 
technique of Megiddo [15]. 

Furthermore, since = for all the sets S G Tpi that are tight with respect to /i* — which are 

precisely the sets in "P* — we have Aj > in each iteration. By choosing Aj to be maximum in step (2a), there 
is at least one set S G Tpi that is tight with respect to but not /i*. Hence, \V^\ > \V'^\ > . . . , and the 
procedures will terminate. Let k be the index of the last A that was set in step (2a). Hence, h^^^{U) = 0, and 
■pfc+i _ |[/| Since we start with \V^\ < \U\ and the partitions coarsen at each step, this implies k < \U\ — 1. 
Additionally, point (i) of Theorem 3.6 clearly holds by the termination criterion of the while-loop. 

Hence, it remains to prove point (ii), which we prove by showing the following claim through induction 
from j = A; + 1 to j = 1, where j = I corresponds to the statement (ii): 

k 

h'{S)-^Xifp.{S)>0 V5GJp.-i. (14) 
i=j 

For j = k + l, (14) clearly holds, since h''+\S) = h''{S) - Xkfpk{S) > V5 G T^k, by choice of A^. 
Now let j G {1, . . . , /c} and assume that (14) holds for all values above j. Let S G Tp^-i, and we define 
S' G Tp to be the minimal set in Tj, that contains S, i.e., 

S':= U P- 

Notice that 

h^{S) = h^iS)+ Yl h\P)>y{S') (15) 
Pr\Syt9 

where the equality holds since all sets in are tight with respect to by construction, and the inequality 
follows by standard uncrossing ai^guments: for any set P G 'P-' , P n 5 7^ 0, we have h^{S) + {P) > 
h'^{S U P) by intersecting submodularity and nonnegativity of /i\ and thus the two terms h^{Si) and /i*(P) 
can be replaced by U P) and this procedure can be repeated. In other words, we simply exploit that 

any nonnegative intersecting submodular function has the subadditivity property for any family of sets that 
are connected when seen as hyperedges on the given ground set. 

The inductive step of the proof of (14) finally follows by 

k k k 

h^{S) -Y^XJp.{S) > h\S') -Y^X,fp.{S') = h^+\S') - > 0, 

i=j i=j i=j+l 

where the first inequality follows from (15) and the monotonicity of ^ifv^ i^), and the last one by the 
inductive hypothesis. □ 



22 



D Equivalence of the hypergraphic and bidirected cut relaxations in quasi- 
bipartite graphs 



Let G = {V, E) be a quasi-bipartite graph, where Steiner vertices are not connected by edges (i.e., we 



In words: we need to reserve enough capacity in order to support a unit flow from every terminal to the 
current root r. It was proven in [6] that in quasi-bipartite graphs, the value of (BCR(r)) coincides with the 
optimum value of (LP). This was done by lifting an optimum dual solution for the partition-based relaxation 
(which is equivalent to (LP) even in general graphs [6]) to a dual solution of (BCR(r)). However, the authors 
of [6] posed as an open question: for a given optimum bidirected cut solution, can a corresponding primal 
solution to (LP) be directly extracted without solving (LP)? We answer this question affirmatively. 

To avoid an unnecessary case analysis, we split direct edges between terminals by inserting a dummy 
Steiner vertex (we split the edge cost arbitrarily among the two parts). Let x be an optimum solution to 
(BCR(r)); then the natural decomposition is as follows. For a star with center u G V\R, take an arc (n, s) with 
positive outgoing flow and all arcs H = {{t,u) \ x{t, u) > 0; t ^ s} carrying incoming flow. Let e be the 
minimum capacity on any of these arcs. Then transfer this capacity into a component {s} U {t \ {t, u) S H}. 
Iterate this process until all capacity has been transferred. The main result of this section is: 

Theorem D.l. Let x be an optimum solution for (BCR(r)). Then the natural decomposition yields a feasible 
optimum solution for (LP) with the same objective value. 

Imagine that we want to "relocate" the root from r to another terminal r' . We can do this by considering 
the unit flow from r' to r, and reversing all capacity corresponding to this flow. This provides a feasible 
solution for BCR(r') that we term x*- which again has the same cost (see [11] for a proof). Note that for 
any {u, v} G E, the sum x'^'^\u, v) + x^'^\v, u) is independent of r. For a Steiner vertex u S V\R, let 
N{u) := {v I {u, v} G E} be the set of neighbours in the star with center u. It suffices to show Theorem D.l 
for basic solutions, since the decomposition of a convex combination of capacity vectors equals the convex 
combination of natural decompositions. By standard arguments, we may assume that the edge costs are 
distinct for all edges in the same star. 

Lemma D.2. In a star with center u and r G N{u) one has x^^\r, u) = and x^^\u, s) = Ofor each 
s £ N{u) with c{u, s) > c{u, r). 

Proof. The flow on arc (r, u) can be removed and the flow on {u, s) arc can be redirected to {u, r). Both 
operations would leave the solution feasible and decrease the cost, contradicting optimality. □ 

See Figure 4 for an illustration of the claim. 



have edges only between terminals and Steiner vertices or between terminals and terminals). Let E be the 
bidirection of E, i.e., for any {u, v} £ E, E contains arcs {u, v) and {v, u). 
The bidirected cut relaxation with root terminal r £ Ris 



mm 




e&E 



x((5+(5))>l yS QV\{r} : SnR^^ 
Xe>0 yee E 



(BCR(r)) 
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Figure 4: Arcs in the optimum solution x'--^^ that may carry positive flow. 

Next, we consider one iteration of the natural decomposition for a star with center u. For this reason, 
insert an extra Steiner vertex u into the graph, which has an edge {u, s} with s G i? iff there is an edge 
{u,s} G E with x^'^\s,u) > 0. For e = {u,s), we abbreviate e = {u,s) (see Figure 5). We define 
c(e) := c(e) and x^'^\e) = for all e G 5{u). Note that x^'^'^ is still an optimum solution. 

Lemma D.3. Let r := argmin{c{u,r) \ r G N{u)}, H := {{u,r)} U {{s,u) \ s G A''(ti)\{r}} and 
£ := minjx^''^ (e) | e G H}. Starting from x^^'\ transfer capacity of £ from all arcs e £ H to e and term the 
new capacity reservation x^'^\ Then the new capacity vector x^"^^ is a feasible optimum solution for (BCR(r)). 

Proof. We first show that the claim holds for some e > which is small enough. Consider any cut 
S C F\{r} and assume for the sake of a contradiction that x^^\d~^ (S)) < 1. For e > small enough, this 
may only happen if S was a tight cut before, i.e. x^'^\6^ (S)) = 1. Furthermore, any critical cut must contain 
at least two arcs of the form {s, u), i.e., \N{u) (1 S\ > 2. Pick r' := argmin{c(u, r') \ r' G N{u) R S}. 
According to Lemma D.2, the flow is x('')(e) = for e G U 6+{u))\H. Since x^''\6+{S)) = 1, the 

unit flow from r' to r needs all capacities on {s,u) arcs for s £ N{u) H S. Consequently, when relocating 
the root to r', the flow on all these arcs must be turned around completely. In particular x^^ ^ {u, s) > for 
s G iN{u) n 5')\{r'} contradicting Lemma D.2. 




(a) capacities x^''' (b) capacities x' 



Figure 5: Transferring capacity of e according to Lemma D.3. (a) visualizes capacity in x^'^\ where newly added edges 
e are dashed. (6) depicts a;''"' and a potentially critical cut S. 

We conclude that we can choose some e > s.t. x^^'^ is feasible. But the argument above shows that no 
cut S can become tight, thus the only limitation on e is the arc capacity. The claim then follows. □ 

We apply Lemma D.3 iteratively to all stars, adding copies of Steiner nodes as required, until we have 
a solution x* (with root r* e R chosen arbitrarily) where (i) every Steiner node has flow on at most one 



24 



outgoing arc, and (ii) the flow on all arcs of a star carrying a nonzero amount of flow is the same. Then x* 
induces a solution to the directed component-based relaxation: 

min ^ cost(C) • yc,s 
Ce/C,sec 

yc\s > 1 V0C5Ci?\{r*} 

C£lC,s£C:CnSj^9,s^S 

yc,s > VC G /C Vs G C. 

The solution y* corresponding to x* is obtained by setting, for each flow-carrying star with terminal set C 
and outgoing flow on arc (u, s), ^ = x*{u, s) (the common flow value in the star). All other components 
of y* are zero. It is easily checked that y* is feasible, and has the same cost as x* (and hence x). Then 
projecting to the undirected formulation, the vector {J2secyc,s)ceJC is feasible for (LP) (see [18]), and 
moreover corresponds precisely to the natural decomposition described earlier. 



E NP -hardness for solving the component-based relaxation 

It is well-known that there is a PTAS for solving (LP). In other words, for every fixed e > 0, there is 
a polynomial time algorithm that computes a feasible fractional solution to the considered hypergraphic 
relaxation (LP), which is within a 1 + e factor of the optimum fractional value. We argue now, that this is 
best possible (answering the posed question in [6]). 

Theorem E.l. Solving (LP) is strongly NF-hard. 

Proof. Let G = {V, E) be a complete graph with terminals i? = {si, . . . , s^,} C y, edge cost c(e) S {1,2} 
for all e £ E. Bern and Plassmann [1] showed that it is NP-hard to decide whether the cost OPT of the 
cheapest Steiner tree is at most a given parameter Z. 

We construct another Steiner tree instance G' = {V',E') as follows: For each terminal Sj G i? in 
the original instance, we add a teiTninal s^ and an edge SiS^ with cost c{si,s'^) := M with M := ?,v? 
and n = \V\. Furthermore, we downgrade the original terminal to an ordinary vertex, i.e., we define 
R' := {s'j I i = 1, . . . , /c} as terminal set. Let OPT'j be the value of the optimum fractional solution of (LP) 
for instance G' (using components of arbitrary size). 

First we show that OPT < Z ^ OPT'j < Z + k ■ M.het S* be the optimum integral Steiner tree in G. 
We add all Sis[ edges to S* and consider the emerging tree as component with fractional weight 1 and cost 
OPT + k-M. 

Next, we prove that OPT > Z + 1 ^ OPT'j > Z + I + k ■ M (which in turn implies the claim of 
the theorem). Let x be an optimum solution to (LP) in G' . For a component C G /C, we denote E{G) as 
the contained edges from the original graph (i.e. without Sjs'j edges) and by \G\ we denote the number of 
terminals. Either G contains less than k terminals, or cost (i?(C)) > Z + 1. In any case 

cost{E{G)) + M rZ + l + M M \ Z + l + M 
> min < , > > 



\C\-l - \ k-1 ' k-2j - k-l 
using that M = 3n^, k < n and Z < 2n. Now we can bound the cost of the fractional solution as 

OPT'j = {cost{E{G)) + \G\-M)- xc 
ceic 



= M -l)xc+ X](cost(^(C)) + M)xc 

C6/C ce/c 

> {k - 1)M + V + 1 + M)xc = Z + l + kM 
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exploiting XlceiC ^c{\C\ -l) = k-l. □ 

Observe that the above reduction in not approximation preserving. This is not surprising, considering the 
fact that Steiner tree even with edge weights {1, 2} is APX-hard (e.g. by a straightforward reduction from 
Set Cover with sets of size 3). 
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